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Abstract 

This work concerns the behavior of “good” (capacity achieving) codes in several multi-user settings 
in the Gaussian regime, in terms of their minimum mean-square error (MMSE) behavior. The settings 
investigated in this context include the Gaussian wiretap channel, the Gaussian broadcast channel (BC) 
and the Gaussian BC with confidential messages (BCC). In particular this work addresses the effects of 
transmitting such codes on unintended receivers, that is, receivers that neither require reliable decoding 
of the transmitted messages nor are they eavesdroppers that must be kept ignorant, to some extent, of 
the transmitted message. This work also examines the effect on the capacity region that occurs when we 
limit the allowed disturbance in terms of MMSE on some unintended receiver. This trade-off between 
the capacity region and the disturbance constraint is given explicitly for the Gaussian BC and the secrecy 
capacity region of the Gaussian BCC. 


I. Introduction 

In this work we consider the scalar Gaussian channel with various requirements on transmission. The 
scalar Gaussian channel is a degraded channel, meaning that a receiver at a higher signal-to-noise ratio 
(SNR) is a stronger receiver and can obtain all the information that a weaker receiver ( i.e ., a receiver 
at a lower SNR) can. The basic requirement is that of reliable communication. This requirement defines 
the Gaussian point-to-point channel, the capacity of which has been derived by Shannon in his seminal 
paper [1], In [2] and [3] the behavior of the input-output mutual information, at every SNR, assuming 
the input is a codeword from a “good” codebook sequence (a capacity achieving code sequence), has 
been investigated. It was shown that in the limit, as blocklength n goes to infinity, this quantity follows 
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the behavior of an independent and identically distributed ( i.i.d .) Gaussian input to the channel, up to 
the SNR at which reliable decoding is possible. This property is also related to [4] where it was shown 
that the output statistics of a “good” code sequence approaches the output distribution produced by the 
i.i.d. capacity achieving input distribution. It follows that the behavior of the minimum mean-square 
error (MMSE) when estimating a codeword from a “good” codebook sequence from the channel output 
at different SNRs follows the MMSE of an i.i.d. Gaussian input to the channel [2], [3]. As such the 
effect of a “good” code on receivers at lower SNRs in terms of the MMSE is well defined. More recently 
the effect of “bad” code sequences has been investigated [5], that is, reliable codes of lower rates than 
the capacity. More specifically, the exact minimal MMSE that can be obtained by a code of rate R was 
derived, and was shown to be that obtained by a supeiposition code sequence. This gives some engineering 
intuition into the efficiency of the Han-Kobayashi achievable region to the two-user Gaussian interference 
channel [6]. 

When considering potential receivers of the transmission at lower SNRs, an immediate requirement 
is confidentiality. This is exactly the degraded Gaussian wiretap channel, where we have one legitimate 
receiver, which requires reliable communication, and one eavesdropper, which must be kept as ignorant as 
possible with respect to the transmitted message. The degraded wiretap channel was presented by Wyner 
[7], who showed that secrecy can be obtained simply by taking advantage of the physical layer - the 
channel to the legitimate receiver as compared to the channel of the eavesdropper. Since this initial work 
many have investigated the model and important results have been obtained: specifying the results for 
the Gaussian model [8], extending to the general non-degraded channel [9], extensions to the Gaussian 
multiple-input/multiple-output (MIMO) channel, independently in [10], [11] and in [12] (for a more 
detailed review of the topic see [13] and [14] and references therein). Recently, more attention has been 
given to the design of “good” codes for the wiretap channel. This was first done for the discrete, degraded, 
wiretap channel [15], [16], [17]. Tyagi and Vardi have shown a construction for the scalar Gaussian 
wiretap channel [18]. The approach in both cases was to use existing capacity approaching codes for 
the point-to-point channel to construct from them codes that comply with the secrecy constraint. More 
recently, Chou and Bloch considered the discrete broadcast channel (BC) with confidential messages and 
developed a low-complexity polar code scheme [19]. 

In this work we continue to examine the scalar Gaussian wiretap channel and specifically the properties 
of the MMSE function of specific families of codes for this setting. This analysis proves known “rules 
of thumb” in the design of such codes, and extends upon them to provide a good understanding to what 
is expected from a “good” code for this channel. Moreover, similar to the analysis of optimal point-to- 
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point codes done in [2] and [3], such an understanding allows us to determine the effect/disturbance of 
these codes on other possible unintended receivers that arc neither eavesdroppers nor require the reliable 
decoding of the message. 

A different requirement to consider is reliable communication to an additional, weaker, receiver on top 
of the reliable communication to the stronger receiver. This defines the degraded Gaussian BC. The BC 
was originally presented by Cover [20], who also defined the degraded BC, and conjectured its capacity 
region. The achievability proof of the degraded BC was presented by Bergmans [21], while the converse 
proof was derived by Gallager [22], A converse proof for the Gaussian BC was derived in parallel by 
Bergmans [23] using the entropy power inequality (EPI). An alternative proof based on the single-letter 
expression and using the I-MMSE relationship was presented in [24]. A comprehensive survey of the 
BC can be found in [25] or [26]. 

In [3] the behavior of the MMSE function of a typical code sequence from a hierarchical code ensemble, 
designed to achieve capacity of the Gaussian BC, has been derived, via statistical physics methods. In 
this work we extend upon the above results and show that any “good” code sequence for the degraded 
Gaussian BC exhibits the same behavior for the mutual information and MMSE functions. This behavior 
is that of a capacity achieving supeiposition code sequence, but holds for any “good” code sequence, that 
is, also code sequences designed using the “Dirty Paper Coding” (DPC) approach [27], [28]. As such, 
the full effect of “good” code sequences in term of the mutual information and MMSE functions is well 
defined. Moreover, we provide necessary and sufficient conditions for reliable decoding in general and 
for “good” code sequences for the degraded scalar Gaussian BC, specifically. These conditions are given 
in the form of properties on the MMSE and conditional MMSE functions. 

The two requirements mentioned above, that is, confidentiality and an additional weaker receiver, can 
be combined into a degraded Gaussian BC with confidential messages (BCC). This problem has been 
considered already in the work of Csiszar and Korner [9], as the reliable rate to the weaker receiver can 
be considered as a rate of a common message (due to the degraded nature of the channel). Csiszar and 
Korner [9] provide a full single-letter solution for the rate equivocation region of this problem. 

In this work we explore the case of maximum level of equivocation (secrecy) for the message sent to 
the stronger receiver in the BCC. We distinguish between the cases of maximum possible rate for this 
message and the case of complete secrecy. We show that in both cases the MMSE when estimating the 
codeword from the output follows that of the Gaussian BC, and the difference in the capacity region is 
due to the different behavior of the other relevant MMSE functions. 

So far we have mentioned two types of requirements: reliable communication and confidentiality 
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(equivocation). Another requirement concerns unintended receivers that arc not interested in the trans¬ 
mission ( i.e., do not require reliable communication) and are also not potential eavesdroppers (■ i.e., so 
no confidentiality is required). As shown in [5] an MMSE constraint at such receivers provides useful 
insights into the disturbance that the transmission imposes on these types of receivers. With respect to this 
we examine two problems: the capacity region of the degraded Gaussian BC given an MMSE constraint 
and the secrecy capacity region of the degraded Gaussian BCC given an MMSE constraint. In both 
problems we need to distinguish between two cases depending on the SNR of the unintended receiver. 
We show that the capacity achieving scheme in both is based on supeiposition, whereas in the Gaussian 
BCC, depending on the SNR of this receiver, we can obtain capacity by using either a supeiposition 
code sequence for the wiretap code sequence to the stronger receiver, or a supeiposition code sequence 
to the weaker receiver. 

As this work concerns the MMSE performance of code sequences, we use the fundamental relationship 
between information theory and estimation theory, namely the so called I-MMSE relationship [29]. As we 
discuss “good” code sequences, we consider the behavior of the MMSE function in the limit as n — > oo. 
This limit might not always exist; however if we restrict ourselves to information stable distributions 
(for which the information rate is defined), we can discuss the lim sup of the MMSE function. We show 
that the relevant properties used in the context of the limit can also be considered with regal'd to the 
lim sup. Specifically, this holds for the “single crossing point” property, originally derived in [24], which 
examines the difference between the MMSE function assuming a Gaussian input distribution and the 
MMSE function of any arbitrary input distribution. This property shows that this difference has at most a 
single crossing of the horizontal axis, meaning that if for some SNR the two MMSE functions are equal, 
for any higher SNR the MMSE function of the Gaussian input will attain a higher value (assuming the 
two are not equal for all SNR, which occurs only when both have the same Gaussian input distribution). 
In this paper we extend the simplest vector version of this property, derived in [30], to a conditioned 
version. 

Notation: We denote a random variable using uppercase letter, such as X. Its realizations are denoted 
using a lower case letter {e.g., x ). We denote a length n random vector with bold uppercase letter and 
the subscript n {e.g., X n ). Specific realization of the random vector are denoted using the equivalent 
lower case {e.g., x n ). When we consider quantities after taking the dimension to infinity, n —>■ oo, we 
remove all subscripts n. The trace operator is denoted by Tr and the transpose operator is denoted by 
(•) T . The standard vector norm is denoted by || • ||. The expectation operator is denoted by E and the 
derivative of some function with respect to one of its parameter, for example 7 , is denoted by D 7 . 
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The paper is structured as follows: we begin with some preliminary results and definitions in Section 
II. The Gaussian wiretap problem is discussed in Section III. The Gaussian BC problem is discussed in 
Section IV. The Gaussian BCC is discussed in Section V, and the disturbance constraints are considered 
in Section VI. We then conclude and summarize the paper. 

II. Preliminary Definitions 

In this section we provide the central definitions and results that will be used throughout the paper. 

A. The Code Sequences 

In this work we split the definition of a code sequence and distinguish between the set of codewords and 
the mapping of messages to these codewords. We define a codebook sequence C = {C n },^ =1 where C„ 
is a set of length-n codewords denoted as the random vector X„ with realizations denoted as x n . These 
codewords must comply with a power constraint. With no loss of generality we assume the following: 

- |j x n |j 2 < 1, \/x n G C n . (1) 

n 

We further define a mapping sequence / = {f n }n Li where f n is a mapping from a message W to a 
codeword of length-n, i.e., 

fn(W) = x n . (2) 

The standard definition of a code sequence is a pair (C, /). Such a pair is said to have rate R if the message 
W is one of {1,2,..., 2 " R } (assumed uniformly distributed over the set) and is mapped to a codeword 
x n G C n . Note that these mappings are not necessarily deterministic, specifically if confidentiality is 
required then stochastic encoding is necessary [14, Section 3.4.1], in which case, for each message Wi 
there is a subset Si C C n such that f n (Wi ) G Si. These subsets arc disjoint in order to allow reliable 
communication over the channel. 

The code sequence pairs (C, /) considered in this paper arc assumed to be information stable, meaning 
all mutual information rates are assumed to converge as n — > oo. 

B. The 1-MMSE Approach 

The main approach used here is the I-MMSE approach, that is to say that we make use of the 
fundamental relationship between the mutual information and the MMSE in the Gaussian channel and its 
generalizations [29], [31], [32], [33]. Even though we arc examining scalar settings, the n-dimcnsional 
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version of this relationship is required since we are looking at the transmission of length-n codewords 
through the channel. In our setting the relationship is as follows: 

1 _ i rsnr 

-I(X n ;Vs™X n + Nn) = - / MMSE( c ^)( 7 )d 7 (3) 

n ^ Jo 

where N n is standard additive Gaussian noise, X n is the input signal of any arbitrary distribution (as 
long as the above mutual information is finite [34]), MMSE lY '" /J (7) = 7 Tr(E x „( 7 )) and E x „( 7 ) is 
the MMSE matrix defined as follows: 

Ex„( 7 ) = E{(X n - E {X n | V7 X n + N n })(X n -E{X n \ ^X n + N n }) T }. (4) 

Note that in our setting the distribution of X n is defined by the mapping f n and the assumption of 
uniformly distributed messages, W. 

The I-MMSE relationship has also been extended to conditional cases [24]. Consider a joint distri¬ 
bution over (X n , U ) independent of the additive Gaussian noise. Denote the random vector X n/U with 
distribution P x \u=u- Applying the I-MMSE relationship we have 

1 1 /" snr 

-/ (X n y,VsmX ntU + N n ) = - MMSE^^)( 7 |f7 = u)d 7 (5) 

n 4 Jo 

where MMSE ( ^ c ' ,l ’A)( 7 |f7 = u) = ^Tr(E Xn ( 7 , u)) and 

E Xn ( 7 , u) = E{(X n , u - E {X n , u | ^yX n , v + N n })(X n , u - E {X n , u \ yfiX n , u + N n }) J }. (6) 

Since (X n , U) is independent of the noise we have that 

E xj 7l «) = E{(X n -E{X n \^X n + N n ,U = u}) 

(X n - E {X n | X n + N n , U = u}) T \U = u}. (7) 

Taking the expectation over U results with 

E {E Xn ( 7 , U)} = E{ (X n - E {X n | ^X n + N n , U})(X n - E {X n \ ^,X n + N n , U}) T } (8) 

and MMSE'^’^E^If/j = E { ^Tr(E Xn ( 7 , (7))}. Moreover, this independence also allows us to write 
(5) as 

i _ i rsnr 

-l(X n -V^X n + N n \U = u) = - / MMSE< c »>'»>( 7 |[7 = u)d 7 . 

n 4 7 0 


( 9 ) 
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Taking the expectation on both sides of (9) with respect to U we obtain the following conditioned version 
of the I-MMSE relationship: 

1 _ 1 fS nr 

-/ (X n ; v^Xn + N n \U) = - / MMSE( c ^( 7 |[/)d 7 . (10) 

n * Jo 

We now take the limit as n —> oo on both sides of the I-MMSE relationship (3). In order to simplify 
notation the quantities at the limit, as n —> oo are simply denoted by removing the subscript n. For 
code sequences for which the relevant MMSE quantities converge as n —> oo, the exchange of limit and 
integration is according to Lebesgue’s dominated convergence theorem [35], since the MMSE quantities 
are always bounded. Thus, we have 

I (X; x/inrX + N)= lim -I (X n ; y/^~rX n + N n ) 

v 7 n— >-oo 77, v 7 

i psr\r 

= lim - / MMSE (C "^")( 7 )d 7 

n—>oo 2 J q 

snr 

MMSE (C ’ /) ( 7 )d 7 . (11) 

When this is not the case (i.e., the MMSE does not converge) we can apply the reverse Fatou’s Lemma 
[35] and conclude that 



where 


lim 

n—» oo 


p snr 

MMSE( c " / ")(7)d7= / MMSE^( 7 ) sup d7 

Jo 


( 12 ) 


MMSE (c ^( 7 ),„ p = limsup MMSE( C " / ")( 7 ) (13) 

n—>oo 

due to the stability of the information rates. We show this precisely in Appendix A. In this case the 
MMSE quantities throughout the paper are the lim sup of the MMSE sequences, which always exist and, 
since the MMSE function is bounded, arc also finite. 

Another property used in the proof is a simple extension of the n-dimensional “single crossing point” 
property derived in [30] to the conditioned case considered here. For an arbitrary random vector X n we 
consider the following function 

q(X n \U,a 2 , 1 ) = —^-E(iTr(E* n ( 7 ,E0)l (14) 

1 + 0-7 [ n ) 

2 

= ^ „ - MMSE( C "^( 7 |(7). (15) 

1 + a -7 



Note that is the MMSE function assuming the input distribution is i.i.d. Gaussian with variance 

a 2 . Thus, the above function is the difference between the MMSE function of an i.i.d. Gaussian input and 
the averaged MMSE function assuming an arbitrary joint distribution over (X n , U), that is independent 
of the Gaussian noise. The following theorem is proved in Appendix B: 

Theorem 1 . The function 7 i-A q(X n \U, a 2 , 7 ), defined in (15), has no nonnegative-to-negative zero 
crossings and, at most, a single negative-to-nonnegative zero crossing in the range 7 G [0, 00 ). Moreover, 
let snro G [0,oo) be that negative-to-nonnegative crossing point. Then, 

1) q(X n \U,a 2 , 0) < 0. 

2) r/(X n |[7, cx 2 , 7 ) is a strictly increasing function in the range 7 G [0, snro). 

3) q(X n \U, a 2 , 7 ) > 0 for all 7 G [snro, 00 ). 

4) lim^oo q(X n \U, o 2 , 7 ) = 0. 

The above properties arc valid for all natural n. As explained above, if the limit of MMSE- C " /J (7l U) 
exists then we may simply take the limit of the function q(X n \U, cr 2 , 7 ). Alternatively, we show in Ap¬ 
pendix C that the “single crossing point” property holds also between and limsup MMSE^^^IC/). 

Finally, in this section we denoted the MMSE function using MMSE^"*^*^) (and its conditional 
version as MMSE^^^It/)) so as to emphasize the dependence on the code sequence. In the sequel 
we will simplify the notation and use the following: 

MMSE(X n ; 7 ) = MMSE (C " / " ) ( 7 ) (16) 

and MMSE(X n ; 7 |(/) for the conditioned version. 

III. The Gaussian Wiretap Channel 

A. Model and Definitions 

The Gaussian wiretap channel [7], over which length-n codewords are being transmitted, is denoted 
here as follows: 


X n — \ Sn r.y X ,j N ln 

Z n = ysnuX,, + N2 n (17) 

where X n is the length-n transmitted codeword. N ln and N- 2 n are standard additive Gaussian noise 
vectors that can be assumed independent of each other, and snr ;(/ and snr, are the SNRs at the two receivers. 
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We assume that snr y > snr z . Y represents the legitimate receiver, and Z represents the eavesdropper. A 
message W y is transmitted to Y and needs to be reliably decoded. The message must be kept secret, to 
some extent, from the eavesdropper. 

An (R, d) code sequence for this channel must reliably transmit the message, W y , of cardinality 2 nR , 
to the legitimate receiver, and also guarantee an equivocation rate of d. These requirements can be written 
as follows: 


lim l I(W y -Y n ) = R 

n—>-oo u 

lim -H(W v \Z n ) = d. (18) 

n—>oo n 


The basic requirement from the pair (R ,d) is that R > d. As stated in [9] these conditions do not 
specify the joint conditional distribution of the outputs given the channel inputs, but only the marginals 
of this joint conditional distribution have an effect. As such, we may assume, without loss of generality, 
N\ n = N 2 n = N n . The rate-equivocation region ( R. d) for this model was obtained by Wyner [7] and 
is as follows: 


f (R ,d): 


C = 


0 < R < / (X;Y) 
0 < d < R 


[ d<[I(X-Y)~I(X-,Z)} 


(19) 


In the Gaussian setting considered here the capacity region is well-known [8], and is obtained by the 
standard Gaussian input in the above expression. We wish to distinguish between three rate-equivocation 
phenomena, which we define next. 


Definition 1. A code sequence obtains complete secrecy when R = dr. 


Definition 2. A code sequence is optimally secure when 

d = d opt = lim - [I (X n -Y n ) - I (.X n ; Z n )\. 

n—Ti 

Definition 3. A code sequence ( R. ci) is at its maximum possible rate if 

R = Rmax = 1™ ~I {X n ‘, Yn') . 

n—>oo ft 


( 20 ) 


( 21 ) 


1 Complete secrecy is sometimes referred to in the literature as full secrecy [14] or perfect secrecy. 
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We further define two quantities. The maximum level of equivocation, 

dm ax = ^log(l + snr y ) - ^log(l + snr 2 ); 


( 22 ) 


and the capacity of the point-to-point channel, which is also the maximum possible rate for the Gaussian 
wiretap channel, for any possible d. 


C = ^log(l + snr y ). 


(23) 


B. Main Results 

In this subsection we provide our main results and observations on code sequences for the scalar 
Gaussian wiretap channel. The proofs of these results arc given in the next subsection. 

Theorem 2. Any code sequence for the Gaussian wiretap channel attains (R, d op t), that is, optimal 
security, if and only if 


MMSE(X; 7 |W 3/ ) = 0, 7 > snr 2 (24) 

and 

MMSE(X; 7 ) = 0, 7 > snr y (25) 

regardless of the rate of the code (R > d op t). 

Note that optimally secure codes do not necessarily contain any confidential information; for example, 
any “good” point-to-point code sequence to Z will also be optimally secure with d = d op t = 0 (as 
/ (X; Y) = I (X; Z)). Nonetheless, the importance of the above result is in emphasizing two properties 
of the family of code sequences that are optimally secure. First, at snr y we can fully decode the transmitted 
codeword and not only the message sent. Second, given the message, full decoding of the codeword will 
be possible at and above snr 2 . This observation supports the capacity achieving schemes in which the 
knowledge of the message leads to full decoding of the codeword by the eavesdropper. Note that the 
above is valid for any point (R, d op t) regardless of R (R > d op t). 

The above behavior can be further specified for the particular' case of d op t = d m ax as follows: 


Corollary 1. Any code sequence for the Gaussian wiretap channel attains (R, d max ), that is, the maximum 
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level of equivocation, if and only if 

MMSE {X\y\Wy) = 0, 7 > snr 2 (26) 

and 

{ T+-, 7 £ [ 0 , snr„) 

1+7 y> (27) 

0 , 7 > snr y 

regardless of the rate of the code (R > d max ). 

Note that the behavior in (27) is the behavior of any “good” point-to-point code sequence, C, to Y 
[2], [3]; however, only a one-to-one mapping over this codebook sequence leads to a maximum point- 
to-point rate. (27) also suggests that there are approximately 2 n 2 l °s( 1 + sni v) codewords in any codebook 
in the sequence of maximum equivocation codebooks. Note that this property defines a large family 
of code sequences which contains those for the Gaussian wiretap channel with (R, d max ). The idea is 
that the maximum level of equivocation determines M MSE(26; 7 ) for every 7 regardless of the rate. In 
other words, in order to obtain a maximum level of equivocation the codewords must resemble the i.i.d. 
Gaussian distribution over the channel (in terms of MMSE) up to an SNR of snr y . 

The additional condition given in (26) (or alternatively, in (24)) is required in order to fully define the 
sub-group of code sequences that are (R, c? max ) (or alternatively, (R, d op t)) codes for the Gaussian wiretap 
channel. Still, these conditions do not fully specify the rate of the code sequence, as the group contains 
codes of different rates R as long as R > c/ rnax (or alternatively, R > d op f). The immediate question that 
arises is: Can we find MMSE properties that will distinguish code sequences of different rates? The next 
theorem provides some insight into two families of code sequences, the complete secrecy, R = d, family 
and the maximum rate, R = R max , family. 

Theorem 3. A code sequence for the Gaussian wiretap channel is an (R, R) code, that is, it attains 
complete secrecy (R = d), if and only if it has the following behavior: 

MMSE(X; 7 |W y ) = MMSE(X; 7 ), V7G[0,snr^) (28) 

and 

1 

R = — MMSE(X; 7 ) - MMSE(X; 7 |VT y )d 7 . (29) 

^ ./cnr 
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A code sequence for the Gaussian wiretap channel attains R max if an d only if it has the following 
behavior: 

MMSE (X\y\W y ) = 0, V 7 > 0, (30) 

meaning a one-to-one mapping of the messages to codewords. 

Specifically, if R = ci max we also have the properties of Corollary 1. 

From Corollary 1 and Theorem 3 (see equation (52) in the proof of Theorem 3) we can conclude that 
when d = d m ax the rate of the code is determined solely by the behavior of MMSE (X ; 7 |VF y ) in the 
region of 7 € [0, snr 2 ) (and is either equal to or above the secrecy capacity rate). Note that this is not 
the case when d < d max since MMSE(JA; y\W y ) does not necessarily fall to zero at snr 2 (if d < d op t 
), and the behavior of M MSE(X; 7 ) is not predetermined. As shown in Theorem 3, when we require 
only complete secrecy, the expression for the rate is given by (29), a function of both MMSE(X; y\W y ) 
and MMSEfTC; 7 ) in the region of [snr 2 . snr, y ). These observations arc also depicted in Figures 1 and 
2 where we depict the behavior assuming a ci max code sequence. In Figure 1 we consider a completely 
secure code sequence, and in Figure 2 we consider the behavior once we increase the rate beyond the 
secrecy capacity. 

Another important thing to observe is that as the condition (30) is a necessary and sufficient condition 
it supports the necessity of a stochastic encoder for any code sequence for the Gaussian wiretap channel 
with R < R max (as shown in [14, Section 3.4.1] for a completely secure code for the discrete memoryless 
wiretap channel). This is due to the fact that 

H(X\Wy)>l(X;^/smX + N\Wy) (31) 

and 

1 f sm 

l(X;y/smX + N\W y ) = - MMSE(X; 7 |FF y )d 7 . (32) 

2 Jo 

Thus, condition (30) guarantees H (X\W y ) > 0 for any such code sequence. 

Note that a central conclusion from Corollary 1 is that any maximum equivocation code sequence has 
a “good” point-to-point codebook sequence. In such a case even when the rate is below capacity we 
can artificially complete the mapping of messages to codewords to a one-to-one mapping. This leads 
us to investigate the “good” point-to-point sequences. The following result basically restates the known 
property that such sequences are also (C, c? max ) achieving for the Gaussian wiretap channel. 
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signal-to-noise ration: 7 


Fig. 1. The above figure depicts the behavior of MMSE(X;7) as a function of 7 assuming ef ma x (in blue), the behavior of 
MMSE(X; "y\Wy) assuming complete secrecy (in red) and the behavior of MMSE(X; fiWy) assuming a “good” point-to-point 
code (in black). Finally, we mark twice the secrecy capacity rate as the region between MMSE(X; 7) and MMSE(X; fiWy ) 
(in magenta). 


Lemma 1. For any snK, G [0,snr y ) define 



(33) 


Any “good” codebook sequence for the Gaussian point-to-point channel to Y, meaning R = C, attains 
maximum equivocation for any snr' z , and is thus also a (C. d' riHX ) codebook sequence for the Gaussian 
wiretap channel (srir^, snr'l). 

It is well known that (C. d' mSLX ) is an achievable rate-equivocation pair for the Gaussian wiretap channel 
[7], [ 8 ], meaning that full point-to-point rate (capacity rate) does not need to be compromised for the sake 
of maximum confidentiality. This pair is attainable by keeping completely secure only part of the message, 
as shown in [9, Ex. a, pp. 408]. The above observation applies to any “good” codebook sequence. It is, on 
the one hand, a straightforward result, but, on the other hand, very vague, since the secured information 
is not defined. Note that we are only considering the codebook sequence without having any specific 
mapping over it. This observation is motivated by the equivalence between the rate-equivocation region 
and the secret-private region (in which one of the messages is completely secure and the additional rate is 
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signal-to-noise ration: 7 


Fig. 2. The above figure we depicts the behavior of MMSE(_Xi; 7 ) as a function of 7 assuming d max (in blue), the behavior 
MMSE(X; ~i\Wy) assuming complete secrecy (in red) and the behavior of MMSE(X; y| W y ) for some arbitrary code of rate 
above secrecy capacity and below point-to-point capacity (in black). We mark twice the rate as the area between MMSE(X; 7 ) 
and MMSE(X; 7 I W y ) (in magenta). 


for a private message) which has been shown in [9, Ex. c, pp. 413]. In our setting we consider any “good” 
point-to-point codebook sequence (with a one-to-one mapping), and do not claim only the existence of 
a “good” point-to-point sequence that is also (C, d( nax ). However, we do not show the existence of a 
specific mapping to conclude the definition of the code sequence (C. d' m . iX ). Such a mapping is equivalent 
to finding a mapping from every original message to a product of a completely secure message and a 
private message. We consider this in the next result. 

Theorem 4. For any “good” sequence for the Gaussian point-to-point channel to Y, meaning R = C, 
assume a partition of the codewords into 2 nrfm “ bins (of approximately equal size), denoted by W s . The 
next two claims are equivalent: 

1) W s is completely secure in the equivocation sense (Definition 1). 

2) Each bin, with messages denoted by W p , is a “good” sequence for a receiver at snr(,. 

According to the above result, given a “good” sequence, which is also a (C, d' mirA ) code for the 
Gaussian wiretap channel (Lemma 1), finding a mapping from W y to ( W s , W p ) (independent) such that 
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W s is completely secure is equivalent to splitting the codewords into sub-codes that arc optimal for the 
eavesdropper. This approach is exactly the one in Wyner’s original work [7], and also emphasized by 
Massey in [36], wherein the achievability proof the construction of the code sequence is such that the bins 
of each secure message are “good” code sequences to the eavesdropper (saturating the eavesdropper). 
The above claim extends this observation by claiming that any mapping of messages to codewords 
(alternatively, any binning of the codewords) that attains complete secrecy must saturate the eavesdropper, 
thus supporting the known achievability scheme of Wyner. The open question posed by the above is the 
existence of such a partition for any “good” point-to-point code sequence. Seemingly, that would be 
shown in the direct paid of the proof of the equivalence between the rate-equivocation region and the 
secure-private region shown in [9, Ex. c, pp. 413]; however the proof does not take any code sequence of 
a given rate-equivocation and shows that there exists a mapping to the corresponding secure-private pair 
code sequence. Alternatively, the proof relies on the achivability proof of the rate-equivocation region 
[9, Ex. a, pp. 408] which actually proves the achivability of a secure-private pair. This proof suffices as 
the goal is to show equivalence between the regions, which is weaker than claiming the existence of a 
mapping from one code sequence to the other. Moreover, note that the achivability proof relies on the 
Coding Stuffing Lemma [9, Lemma 3.18, pp. 334] which claims that for any set of codewords there exists 
a subset that is a disjoint union of sets. This disjoint union of sets is the partition of the codebook into 
bins denoting the secure message. However, the lemma claims the existence of such a subset and not that 
any subset is comprised of a disjoint union of subsets (and of course is limited to discrete memoryless 
channels (DMCs) and relies on properties of the image set size [9, Lemma 3.2, pp. 305]). To conclude 
this discussion, the question regarding the existence of such a partition is still an open one, and what 
we claim is only that given any partition that defines a completely secure message it must saturate the 
eavesdropper as done in Wyner’s achievability proof [7]. 

The above result can be extended. Looking at the proof of Theorem 4 one can observe that the 
fact that the code sequence achieves capacity has no significance in the proof of the equivalence. The 
important properties are that the code sequence attains maximum equivocation ri max and that we have 
either complete secrecy for the message W s or a “good” code sequence in each bin. Let us assume a 
secure-private message pair (W s , W p ) over an (R(a),c? max ) code sequence, where 

R(a) = d max + ~log(l + asnr^) (34) 

for some a 6 [0,1]. Lor such a code sequence we can state the following simple corollary (which relies 
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on Corollary 1, Theorem 3 and Theorem 4). 

Corollary 2. An (R(a),d max ) code sequence transmitting the message pair (W s ,W p ) ( independent ) has 
the following behavior: 

{ t^—, 7 6 [ 0 , snr 2 ) 

1+7 1 . (35) 

0, 7 > snr 2 

Thus, the equivalence of Theorem 4 holds also here - W s is completely secure if and only if the codewords 
in each bin construct a “good” point-to-point code sequence to the eavesdropper (note that for a < 1 
the mapping ofW p is not a one-to-one mapping over the “good” point-to-point code sequences). 

To conclude, we see that any d max code sequence is built on a “good” point-to-point codebook sequence, 
and that a partition of this codebook sequence to bins defines a completely secure message if and only if 
the bins themselves are codebook sequences that achieve capacity to the eavesdropper. This is regardless 
of the rate of the code sequence. 

C. Proofs for the Gaussian Wiretap Channel 

Proof of Theorem 2: The single-letter expression for the rate-equivocation region of a degraded 
channel (19) was given in [7] and extended to the more general case in [9]. Since we are examining the 
properties of length-n codebooks of the scalar Gaussian wiretap channel, which is always degraded, we 
use the limiting version of (19) (see the converse proof in [7] for details). 

Using the chain rule of mutual information we have 

I (X n , Wy- Y n ) = I {Wy- Y n ) + I (X n ; Y n \W y ) 

= I(X n -Y n ) + I(W y -Y n \X n ) 

= I(X n -Y n ) (36) 

where the last transition is due to the Markov chain relationship W y — X n — Y n . Thus, we have 

I (X n - Y n \Wy) = I {x n - Y n ) - I (Wy- Y n ) . (37) 

Similarly and due to the Markov chain condition W y — X n — Z n we also have 


I (X n - Z n \Wy) = I (X n - Z n ) - I (Wy-Z n ) . 


( 38 ) 
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Subtracting these two equations we obtain the following: 


I (X n ; Y n \Wy) - I (X n ; Z n \Wy) = I (X n ; Y n ) - I (X n ; Z n ) - I (W y ; Y n ) + I ( W y ; Z n ) . (39) 

Normalizing by n and taking n —>• oc we have due to the assumption of an (R, d op t) codebook (111) 

I{X-Y\W y )~l(X-Z\W y ) 

= I (X; Y) - I {X-Z) - R + R - d opt 

= I(X;Y)-I(X;Z)-d opt = 0. (40) 


Using the I-MMSE relationship [29] we can conclude that for any (R, d op t) codebook for the Gaussian 
wiretap channel 


MMSE(X;7|lUj / )d7 = 0 


MMSE(X; 7 |W y ) = 0, V 7 > snr*. 


(41) 


This proves (24). It remains to prove (25). This is a simple consequence of the above since we assume 
that reliable decoding of W y is possible at SNR at or above snr y ; thus V 7 > snr y 

MMSE(X; 7 ) = MMSE(X; 7 |lU y ) 

MMSE(X; 7 ) = 0. (42) 


This concludes the proof of the direct paid. 

For the converse part we now assume that (25) and (24) both hold for a codebook where messages 
W y € {1.2..... 2" r } arc mapped (not necessarily deterministically) to codewords X n . From (25) we 
can conclude reliable decoding of the messages at snr y since the MMSE when estimating the codeword 
X n is zero implies that the message can be reliably decoded. Thus, we can use (111) to define d id, < R). 
The Markov chain relationship W y — X n — Y n — Z n still holds and we begin from (39). Using (111) 
we have 


I (X; Y\W y ) - I (X- Z\W y ) = I (X; Y) - / (X; Z) - d (43) 


and we need to show that (25) and (24) guarantee d = d op t■ 
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Using the I-MMSE relationship on the left-hand-side of the above equation we have that 

MMSE(X; 7|TUj / )d 7 = / (X; Y ) - I (X; Z) - d. 

From (24) we can conclude that the left-hand-side is zero and 


1 /“ snr y 

2 ./cnr 


(44) 


d = I (X;Y) — I (X; Z) = d, 


opt • 


(45) 


This concludes our proof. ■ 

Proof of Corollary 1: In order to prove this corollary we need only to show that the maximum 
level of equivocation, c? max , is a specific case of d op t which also determines the exact behavior of 
MMSE(X| v /7X + N). 

Note that 


d < lim 1 [I (X n ; Y n ) - I (X n ; Z n )] 

n—too 71 

1 f snr y 1 1 

d<- MMSE(X; 7 )d 7 < -log (1 + snr^) — -log (1 + snr z ) = d max (46) 

^ Jsnr z A A 

where the last inequality is due to the fact that the MMSE function is upper bounded by the linear MMSE 
for any SNR. This leads to the following conclusion: if d = d, m:ix then we must have equality in the 
above equation, meaning d = dopt■ Thus, the properties of Theorem 2 define these codes “if and only 
if”. Moreover, since 

1 f snr « 1 1 

-J MMSE(X; 7 )d 7 = -log (1 + snr^) — -log (1 + snr,) (47) 

we can conclude that 


MMSE(X; 7 ) = V 7 € [0,snr, y ) (48) 

concluding the direct paid of the Corollary. As for the reverse direction, recall that the properties in 
Theorem 2 lead to (45), that is 

1 r snr y 

d = I(X:Y)-I(X:Z) = - MMSE(X;7)d 7 ; (49) 

^ J snr 2 

adding the additional property in this corollary leads to 


d = -log (1 + snr^) - -log (1 + sn f z) — ^max 1 


(50) 
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This concludes our proof. ■ 

Proof of Theorem 3: Note that due to the Markov chain condition W y — X n — \/smX n + N n we 
have that for any snr 

/ (W y - y/snrX + N) =1 ( X ; V^nX + N) - I ( X ; y'sdFX + N\W y ) (51) 

and using the I-MMSE relationship 

1 f snr 

I (W y -, y/snrX + N) = - [MMSE {X; 7 ) - MMSE (X; 7 ^)] d 7 . (52) 

2 Jo 

Notice that for any 7 

MMSE(X; 7 ) > MMSE (X; 7 |IE y ). (53) 

This is due to the concavity of the MMSE function in the input distribution [34, Corollary 1] (note that 
the result extends to the limit when n 00 simply by taking the limit in [34, Equation (10)]). The 
distribution of X can be written as P w , Px\vj, an d the right-hand-side is the expectation over 

MMSE (X w .]^\W y = wf) = MMSE(X Wi \ y /fX Wi + N,W y = w i ). (54) 

As such the integrand in (52) is non-negative for all 7 . 

If we require complete secrecy, we have that 


i(w y -,y/^r z x + N) = o 

sr\r z 

[MMSE (X; 7 ) - MMSE (. X ; 7 \W y )\ d 7 = 0. (55) 

Thus, we can conclude that 

MMSE (X; 7 ) = MMSE (X; j\W y ), V 7 C[ 0 ,snr*). (56) 

From the reliable decoding at Y and the above equality we can conclude that (29) holds. 

In the reverse direction, setting the above equality leads exactly to reliable communication at rate R with 
complete secrecy. 

If we require R max , then 

I (W y ; v/sbr^X + AT) = / (X; Y ) 




x /- snr » 

- [MMSE(X; 7 ) - MMSE(X; 7 |FF y )]d 7 
2 Jo 

from which we can conclude that 



MMSE(X; 7 )d 7 
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(57) 


MMSE (X; 7 |W y ) = 0, V 7 > 0 (58) 

as expected in a one-to-one mapping of message to codeword. This again holds in the reverse direction. 

Finally, if we further specify that d = d max the exact behavior is determined from Corollary 1. This 

concludes our proof. ■ 

Proof of Lemma 1: We first prove that any “good” point-to-point sequence is a ( C, d may ) sequence 
for the Gaussian wiretap channel. This is simply observed from (43) where the left-hand-side is zero due 
to the one-to-one mapping of the message to a codeword, and we remain with 

d = I(X;Y)-l(X-,Z') (59) 

which equals d' max since this is a “good” sequence and thus (27) holds [2], [3]. This concludes the proof. 


Proof of Theorem 4: Assume that the first claim holds and we have a partition to 2 nd ™** bins that 
is completely secure, meaning I (W s : Z’) = 0. Thus 


I (X; Z'\W S ) = I (X; Z r ) - I (W s ; Z’) 

= ^log(l + snr'J. (60) 

Due to the following upper bound 

I (X; Z’\W S = w) < 7 ,l°g(l + snr^) (61) 

we can conclude that for any W s = w, meaning, for any bin 

l(X-Z'\Ws = w) = ^log(l + snr(). (62) 

Finally, using the result of Corollary 1 we have that 

MMSE(X; 7 |VF s ) = 0, V 7 > snr^ (63) 


implying that given the bin, reliable decoding is possible at Z'. This concludes the first direction. The 
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other direction is even simpler since a “good” sequence in every bin means 

l(X-Z'\W s = w) = ^log(l + snr' 2 ). 

Thus, we can immediately conclude that I (W s ; Z') = 0 via (60). This concludes the proof 

IV. The Gaussian Broadcast Channel 

A. Model and Definitions 

Assume a scalar Gaussian BC, over which we transmit length-n codewords 

Y n — ^/snr yX n T N\ n 

Z n = ^/sntfiXn + N 2n (65) 

where X n is the length-n transmitted codeword. N ln and N 2 n are standard additive Gaussian noise 
vectors. Without loss of generality we assume that snr (/ > snr z , making Y the stronger receiver, which 
can decode both messages. As the capacity region of the above channel depends only on the marginals 
of the joint conditional distribution of the outputs given the input [26], we may assume, without loss of 
generality, N ln = N 2n = N n . 

An (R y ,R z ) code sequence, (C,/), for this channel must reliably transmit the message, W y , of 
cardinality 2 nRy , to Y and reliably transmit the message, W z , of cardinality 2" R % to Z. That is, there 
exists a sequence of encoding functions such that X n = f n (W y ,W z ), where W y £ {1,2,..., 2” R »} 
and W z £ {1,2,..., 2 nRt } such that the error probability when decoding W z from Z n and the error 
probability when decoding W y from Y n , go to zero as n —> 00 . These requirements can be written as 
follows: 

lim -I(Wy-Y n ) = Ry 
n^oo n y 

lim —I (W z ; Z n ) = R 2 . (66) 

n—¥ 00 Tl 

The capacity region of the channel is well known [26]: 

(R y , Rj.) : 

c = 0 < Ry <I{X-Y\U) ■ (67) 

0 < R z <I(U-Z) 

where U is an auxiliary random variable such that U — X — Y — Z forms a Markov chain. In the 
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Gaussian setting considered here the capacity region is well-known [23], and can is obtained by choosing 
X = U + V where V and U are independent Gaussian random variables with variances that sum to 
one (Bergmans [23] has provided a converse proof that does not rely on the single-letter expression [22] 
using the entropy power inequality (EPI). A simpler proof based on the single-letter expression using the 
EPI was shown in [37], and a proof using the I-MMSE relationship was given in [24]). 

Given these requirements and due to the data-processing lemma we have the following inequality 

lim 1 / (x n - Y n ) > R y + R Z = I (Wy, W z ; Y n ) . (68) 

n—>oo n 

Equality holds when we have a deterministic encoder. 

B. Main Results 

We first wish to understand what are the implications of reliable decoding in terms of the MMSE 
behavior. We consider any code sequence designed for reliable communication of the pair (W y ,W z ) to 
the two receivers. The sequences can be either “good”, capacity achieving, sequences or “bad” sequences, 
that do not necessarily achieve capacity. 

Theorem 5, Consider a code sequence, transmitting a message pair (W y , W z ), at rates (R, r R~), over 
the Gaussian BC. W z can be reliably decoded from Z if and only if 

MMSE(X; 7 |W Z ) = MMSE(X; 7 ), V 7 > snr z . (69) 

The above theorem formally states a very obvious observation which is that once W z can be decoded, it 
provides no additional information to the estimation of the transmitted codeword, beyond the information 
in the output. We strengthen this insight by proving that this property is sufficient for reliable decoding. 

In [34, Theorem 1] it was shown that the MMSE function is a concave function in the input distribution. 
Moreover, for the additive Gaussian channel is was shown [34, Theorem 2] that this is actually a strict 
concavity. However, these results were shown for finite dimension, and although the concavity result 
transfers quite simply to the limit, the condition for strict concavity does not. As such the above does 
not contradict these results. 

Our main result is an extension of the result given in [3], where it was shown that a typical code 
from the hierarchical code ensemble (which achieves capacity) designed for a given Gaussian BC has a 
specific MMSE behavior. We extend this result to any “good” code sequence as follows: 
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Theorem 6. Any “good” code sequence for the Gaussian BC, with a rate pair that can be depicted as 
follows: 

(Rv,R,)= (^oi(l + /3s„r v ),i|og(i^-)) (70) 

for some (3 <S [0,1], has the MMSE behavior of a Gaussian superposition codebook as n — > oo, that is 

snr G [0, snr 2 ) 

MMSE(X;snr) = < snr e [snr 2 , snr. y ) • (71) 

0, snr > snr y 

Note that the above behavior holds for any “good” code sequence for the Gaussian BC. This includes 
also codes designed for decoding schemes such as “dirty paper coding”, in which case the decoding at Y 
does not require the reliable decoding of the known “interference” (the part of the codeword that carries 
the information of 1-14), but simply encodes the desired messages against that “interference”. As such, 
one does not expect such a scheme to have the same MMSE behavior as a supeiposition code scheme, 
where the decoding is in layers: first the “interference” and only after its removal, the reliable decoding 
of the desired message. 

The above theorem considers only the codebook sequence, and not the mapping of messages to 
codewords. Moreover, from the proof one can see that (as known) a deterministic mapping is required 
for any “good” code sequence. 

Theorem 7. Any code sequence complying with (69) and (71) is a reliable code sequence for the Gaussian 
BC depicted in (65) and will have the following property: 

MMSE(X; 7 |W 2 ) > -J —, V 7 e[0,snr 2 ]. (72) 

1 + PI 

Moreover, such a code sequence is “good” if and only if on top of (69) and (71) it has a deterministic 
mapping from (W y ,W z ) to the transmitted codeword and 

MMSE(X; 7 | W z ) = —^—, V 7 G [0, snr 2 ], (73) 

1 + P7 

I.e., (72) is satisfied with equality. 

Figure 3 depicts the result of Theorem 7 for “good” code sequences. 



24 



Fig. 3 . In the above figure we consider the behavior of MMSE(.Xi;7) (in blue) and MMSE(X;7|W 2 ) (in red) required from 
a “good” code sequence for the Gaussian BC. Twice R 2 is marked as the area between these two function (in magenta). 


C. Proofs for the Gaussian BC 

Proof of Theorem 5: First notice that due to the data-processing lemma we have that 
I ( W z ; + N n ) < I (w z - V^x n + N n ) 

for any snr > snr z . Since we have reliable decoding of W z at Z and using Fano’s inequality we have 
that as n —> 00 


I(W z] ^GX + N) = R z (74) 

and 

l(W z -VsnrX + N)=I(W z -^F z X + N) (75) 

for all snr > snr z . 

Now, consider the chain rule of mutual information, 


l(X n ,W z -VsnvX n + N n ) 
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= I (w z - V^rX n + N n ) + I (X n ; VsnrX n + N n \W z ) 

= I (X n j y/srwX n + AT n ) + I (W z \ \J snrX n + iV n |X n ) 

= I (X n ; Vsnr^n + N n ) (76) 

where the first and second ttansitions both result from applying the chain rule of mutual information 
on I (X n , W z ; \/snrX n + 7V„). The last equality is due to the Markov chain relationship (W z , W y ) — 
X n — \JsmX n + N n . Thus, we have 

I (W z ; \ZsnrX n + N n ) = I (X n ; V^X n + N n ) - I (X n ; VsnrX n + N n \W z ) . (77) 


The above holds for all n. Taking into account the equality in (75), which holds when n —> oo, we have 
that for any snr > snr 2 


I (X; v^srirX + N) - I (X; v^X + N) = I (X; x/snrX + N\W Z ) - I (X; v'snrTX + N\W Z ) 

(78) 


which in terms of the I-MMSE relationship [29] gives us the following: 


MMSE(X; 7 )dy = / MMSE(X; 7 |W*)d 7 . 


(79) 


Due to the concavity of the MMSE in the input distribution [34] (which holds also in the lim or the 
limsup) we have that 


MMSE(X; 7 ) > MMSE(X; 7 |VE 2 ), V 7 > 0 


(80) 


and we can conclude that 


MMSE(X; 7 ) = MMSE(X; 7 |kE 2 ), V 7 > snr 2 . 


(81) 


This concludes the direct paid of the proof. 

For the converse, we assume that (69) holds. Applying the I-MMSE relationship on (77) we have that 


1 


I (W z ; y/snrX + N) = - / MMSE(X; 7 ) - MMSE(X; 7 | 


(82) 


Due to (69) we have that for every snr > snr 2 


/ (W z - v^FX + N) = I (W z ; ^ v /snr7X + N ). 


(83) 
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For snr —> oo we have that 


lirn I (W z ; VsnrX + N) = I (W z ; X) = H(W Z ) = R, 


As such we have that 


I (W z ; VsnrTX + N) = H(W Z ) = R, 

meaning reliable decoding of W z from Z. This concludes the proof. 
Proof of Theorem 6: Using (77) we have 


(84) 


I (X n ; Z n \W z ) = / (X n ; Z n ) - I (W z ; Z n ) 

= I (X n ; Z n ) - [H (W z ) - H (W z \Z n )] 

= I (X n ; Z n ) - nR z + H (W z \Z n ). 

Using Fano’s inequality and the trivial upper bound on the mutual information we have 

I (Xn, Z n \W z ) = I (X n - Z n ) - nR z + H (W z \Z n ) 

< ^log (1 + snr,) + H 2 (. P n (e )) + P n (e) log (2” R = - l) 

where Ho (•) denotes the binary entropy. Dividing the above by n and denoting 

1 


- nR z 


$n = - 

n 


H 2 (P n (e)) + P n (e)\og[T R > -1 


we have 


-I (X n ; Z n \W z ) < ~log (1 + snr 2 ) - R, + S n 
n 2 

where S n -> 0 as n -> oo. Since we assume (70), that is, 

R. = hog ( 1 + snr A , 

2 “ \ 1 + /Tsnr, / 

we have 


-I (X n ; Z n \W z ) < ~log (1 + (3snr z ) + S n . 
n 2 


(85) 


( 86 ) 


(87) 


( 88 ) 
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Recall the conditional I-MMSE relationship: 

1 1 f snr - 

-I(X n] Z n \W z ) = - MMSE(X n ; 7 |fU 2 )d 7 . (89) 

n 2 J 0 

Using this in (88) and taking the limit as n oo we obtain 

Snr 2 J 

MMSE(X; 7 |W 2 )d 7 < -log (1 +/3snr 2 ). (90) 

Due to the “single crossing point” property, given in Theorem 1, we can conclude that the crossing point, 
if it exists, occurred within [0, snr~), and thus, at snr 2 we have the following upper bound: 

MMSE(X; snr 2 |IU 2 ) < -——^-. (91) 

1 + psnr 2 

According to Theorem 5 

MMSE(X; snr 2 |IU 2 ) = MMSE(X; snr 2 ) (92) 

since W z can be reliably decoded from ^/sr\r z X + N. In [5, Theorem 3] the following optimization 
problem has been investigated 



max I (X; -y/snr yX + N ) (93) 

s-t. MMSE(X; snr 2 ) < -——^-. (94) 

1 + psnr 2 

It was shown that the solution is 


max/ (X; ysnr ~X + N) = ^log (1 + /3snr y ) + ^log • ( 95 ) 

Recall that we have a “good” code sequence for the Gaussian BC, meaning that 

R„ + R, = hog (1 + /Jsnr„) + hog • < 96 > 

Given the above and the data-processing inequality we can conclude that in order to obtain the sum- 
capacity the mapping must be deterministic and we must obtain the maximum for I (X; Asnr^X + TV). 
Thus, the capacity point of the Gaussian BC will have the following two properties: 



+ AT) 


MMSE(X; snr 2 ) 


^log(l + /3snr y ) + ^log 

l 

1 + /3snr 2 


/ 1 + snr 2 \ 
\ 1 + /5snr 2 J 


(97) 

(98) 
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and according to [5, Theorem 6] the MMSE and mutual information behavior of this code is fully defined 
(for every SNR) and is that of the Gaussian two-layered supeiposition codebook (optimally designed for 
the Gaussian BC with a rate splitting coefficient /?). Finally, note that we have shown that the mutual 
information / (X; ysnr y X + N) cannot exceed the sum-rate. This means that the encoder must be 
deterministic in order to obtain capacity. This concludes our proof. ■ 

Proof of Theorem 7: From the assumptions we first conclude that the code sequence is a reliable 
code sequence for the Gaussian BC. This is due to Theorem 5 that ensures the reliable decoding of 
W z from Z and the fact that MMSE(X; 7 ) is zero at snr y meaning the entire codeword can be reliably 
decoded from Y. Moreover, from Theorem 5 and the “single crossing point” property given in Theorem 
1 we can conclude that, since 

MMSE(X;snr 2 |IF 2 ) = —- (99) 

1 + psnr 2 

we must have that 

MMSE(X; 7 |W*) > —, V 7 G[0,snr,). (100) 

1 + PI 

It remains to show that the code sequence is “good”, if and only if, on top of (69) and (71) the code 
sequence has a deterministic mapping and we have equality in (100) for all 7 e [0, snr 2 ). 

If the code sequence is “good” then the rate pair is of the form shown in (70) and the behavior of 
MMSE(X; 7 ) is as proved in Theorem 6 (and the mapping must be deterministic). According to Theorem 
5 we also have (69). Due to the chain rule of mutual information (77) we have 

R z = I{W z -Z) 

= I (X; v^X + AT) - / (X; yifiTTX + N\W Z ) 

snr 2 

MMSE(X; 7 ) — MMSE(X; 7 |BA)d 7 (101) 

and thus 

snr. 1 

MMSE(X; 7 |W / z)d 7 = -log(l + /3snr 2 ). (102) 

Due to (100) this holds only if we have equality for all 7 e [0, snr 2 ). This proves the direct paid. 

As claimed above, conditions (69) and (71) suffice for the code sequence to be a reliable code sequence 
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for the Gaussian BC. Similarly to the approach used to obtain (101) we can obtain an expression for R (/ : 


R„ = - 


MMSE(X; 7 ) - MMSE(X; 7 |IU y )d 7 . 


(103) 


Due to (71) we have 


1 1 [ snr ‘ 

R. = -log(l + snr 2 ) - - / MMSE(X; 7 |IU 2 )d 7 
1 z Jo 


(104) 


and 


R :y = o lo § 


1 f (1 + snr 2 )(l + /3snr y )\ 1 


1 + /3 snr 2 


_ 1 |q / (l + snr 2 )(l + (3sm y ) \ 1 

2 ° S \ 1 + /3snr 2 / 2 


MMSE(X; 7 |W y )d 7 


MMSE(X; 7 |IU y )d 7 


(105) 


where the second equality in (105) is due deterministic mapping of messages to codewords. In such a 
case, since W z is reliably decoded at Z, the knowledge of W y is sufficient to obtain the exact transmitted 
codeword, meaning the MMSE function will go to zero. 

Using the chain rule of mutual information we also have 


l(W Z ,Wy-Z)=l(Wy,Z)+l(W Z -Z\Wy) 

= l(Wy-Z,W Z )+l(W Z -Z\Wy) 
= l(Wy-Z\W Z )+l{W Z -Z\Wy) 


(106) 


where the second transition is due to the reliable decoding of W z from Z and the final transition is due 
to the independence of the two transmitted messages. Using the assumption of a deterministic mapping 
of messages to codewords the above reduces to 


/ (X; Z) = I (X; Z\W z ) + I (X; Z\W y 


(107) 


Applying the I-MMSE relationship we obtain the following relation: 


1 rsnr z i r snr z 

MMSE(X; 7 )d 7 = - / MMSE(X; 7 |W 2 )d 7 + - / MMSE(X; 7 |IU y )d 7 . 

2 7o 2 J 0 


Note that this relationship holds only for the integration up to snr 2 due to the reliable decoding of W z at 
that point, and does not imply any such equality between the integrands (the MMSE quantities). Given 
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(71) and (73) we have that 


1 /- snr - 
2 Jo 


MMSE(X; 7 |W„)d 7 = - / MMSE(X; 7 )d 7 - - / MMSE(X; 7 |W 2 )d 7 


/ o 


/o 


= ~log(l + snr 2 ) - ~log(l + /3sm z ). 


(108) 


Substituting the above in (105) we obtain 


R y = -log(l + /3snr. y ) 


(109) 


thus concluding the proof. 


V. Gaussian BC with Confidential Messages 
A. Model and Definitions 

Combining the Gaussian wiretap channel and the Gaussian BC we obtain the degraded Gaussian BC 
with confidential messages (BCC). In this section we put together some of the observations regarding the 
Gaussian wiretap channel and the Gaussian BC given in the previous sections so as to conclude regarding 
the behavior of the relevant MMSE quantities of the Gaussian BCC. 

Consider a scalar additive Gaussian channel, over which length-n codewords are being transmitted. 
This is depicted as follows: 


Y n — sfism y X n -\- Ni n 

Z n = v / snr 7 .X n + N-2 n (110) 

where snr ; , < snr y . N\ n and are standard additive Gaussian noise vectors. As in the Gaussian 
wiretap channel, considered in Section III, and in the Gaussian BC, considered in Section IV, also in 
this model we may assume that N\ n = N\ n = N n without loss of generality [9]. 

An (R, y , R z ,d) code sequence for this channel must reliably transmit the message, W y , of cardinality 
2 nR y , to y while guaranteeing an equivocation rate of d and also reliably transmit the message, W z , of 
cardinality 2" R 7 to Z. These requirements can be written as follows: 

lim -I{Wy-Y n ) = R y 

n—Tl 

lim -H(Wy\Z n ) = d 
n—>oo Tl 
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lim —I (W z ; Z n ) = R z . (Ill) 

n—>oo 77 , 


The basic requirement from (Ry, R z , d) is that R y > d. 

The capacity region of the channel is well known [9, Theorem 1]: 

(Ry; Rzj d) '■ 

0<Ry + R z <I(X-Y\U) + I (17; Z ) 


C=l 


0 < d < Ry 


d<[I{X-Y\U)~I{X-Z\U)] 
0 < R z < I (U-Z) 


( 112 ) 


where U is an auxiliary random variable such that U — X — Y — Z forms a Markov chain. The capacity 
region of the Gaussian setting was considered in [38]. 

Finally, note that an optimally secure code sequence follows the same definition as given in Definition 
2; however since reliable decoding of W z is possible at Z this is equivalent also to 


d opt = I (X; Y\W Z ) - I (X; Z\W z ). 


(113) 


B. Main Results 

Putting together Theorem 3 and Theorem 5 we have the following result: 

Corollary 3. A code sequence transmitting (W y . W z ) at rates (Ry. R,) such that W y is reliably decoded 
by Y and completely secure from Z, and W z is reliably decoded by Z if and only if 


MMSE(X; 7 ) = 

1 MMSE(X; 7 |Wy), 7 e[0,snr z ) 

and 

(114) 

MMSE(X; 7 ) = 

y MMSE(X; 7 ITT), 7 G [snr 2 , snry) 

MMSE(X ;7 VFy) = MMSE(X; 7 | W z ), 

V 7 > snry. 

(115) 


When d < Ry (no longer complete secrecy) MMSE(X; 7 ) will no longer be equal to MMSE(X; 7 |VF y ) 
in the region of [0, snr z ). 

Note that we can also apply the result of Theorem 2 to the above setting, meaning that if we require 
optimal security we have that 


MMSE(X; 7 |Wy) = 0, V 7 > snr z 


The above corollary considers any code sequence with complete secrecy. Theorem 2 for the Gaussian 
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wiretap channel considers any code sequence with optimal security, and the combination of the two gives 
us the set of MMSE properties of code sequences that are both completely secure and optimally secure. 
Considering a more specific depiction of the MMSE quantities of “good” sequences we distinguish 
between two families: secrecy capacity sequences (which achieve both complete secrecy and optimal 
security), and optimally secure “good” sequences. In both cases we aim at the highest level of equivocation 
and try to maximize the rate pair; however in the first family we further restrict the rate R y to complete 
secrecy (R y = cl). Next we will show that both families have the same MMSE(X; 7 ) behavior, and that 
the second family is a subset of the family of “good” code sequences for the Gaussian BC. 


Theorem 8, Consider a code sequence transmitting ( W y , W z ) at rates (R ;/ . R z ) such that W y is reliably 
decoded by Y and completely secure from Z, and W z is reliably decoded by Z. Any “good” sequence 
has the following rate pair: 


(Ry, R z ) 



/ 1 + /3sm y \ 
\1 + /3snr 2 / 



f 1 + snr z \\ 
\l + !3snr z ) ) 


(116) 


for some 6 G [0,1], and has the following behavior: 


and 


MMSE(X; 7 ) = { 


1 

1+7’ 

P 

1 + 07 ’ 


0 , 


7 G [0, snr 2 ) 

7 G [snr z ,snr y ) , 

7 > snr.y 


MMSE(X; 7 IB 4 ) 


T+pfi 7 G [ 0 , snr y ) 
0 , 7 > snr y 


MMSE(X; y\W y ) 


T 77 , 7 G [0, snr z ) 
0 7 > snr 3 


(117) 


(118) 


(119) 


The above is attained by a superpositions code sequence , X = V + U. where V is an optimal Gaussian 
code of power 1—/3 and rate R :i (attained by considering U as additional Gaussian noise). U, independent 
of V, has power 3, and is a completely secure Gaussian codebook to Y with eavesdropper Z. and thus 
attain R y . 


Theorem 9. Consider a code sequence transmitting ( W y , W z ) at rates (R y , R z ) such that W y is reliably 
decoded by Y and optimally secure from Z. and W z is reliably decoded by Z. Any “good” sequence 
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has the following rate pair: 


(R y, R*) = o lo S (1 + 3 snr y), o'og 


for some 3 G [0,1], and has the following behavior: 

MMSE(X; 7 ) = < 


i+hi ’ 

o, 


1 + snr 2 
1 + 3 snr 2 


rp^» 7 G [0, snr 2 ) 
3 


7 G |snr 2 , snr y J 
7 > snr. y 


and differs from the relations in Corollary 3 only in the fact that 


( 120 ) 


( 121 ) 


MMSE(X; 7 |Wy) < MMSE(X; 7 ), 7 G[0,snr 2 ) (122) 

and 

(rril) = \[" <123) 

The above is attained by superposition, X = V + U, where V is an optimal Gaussian code power 1 — 3 
and rate R 2 . U, independent ofV, is a Gaussian optimally secure (maximum level of equivocation) code 
sequence for the wiretap channel to Y with eavesdropper Z. 


C. Proofs for the Gaussian BCC 

Proof of Theorem 8: We consider complete secrecy, R (/ = d and wish to examine a capacity rate 
pair. The rates can be written as follows: 


R y = I (Wy- Y) = I (.X; Y) — I (X; Y\W y ) 

1 r snr y 

= -/ [MMSE(X; 7 ) - MMSE(X;7|ir: y )]d7 

2 Jo 

1 r snr v 

= -/ [MMSE(X;7|W 2 ) - MMSE(X; 7 |W 2/ )]d7 (124) 

2 J snr = 

where in the last transition we have taken into account the fact that this is a complete secrecy code 
sequence (by taking MMSE(X; 7 ) = MMSE(X; 7 | Wy) for all 7 G [0, snr 2 )) and reliable decoding of 
W z at snr 2 (by taking MMSEfXly) = MMSE(X; y\W z ) for all 7 > snr 2 ) as given in Corollary 3). For 
the weaker receiver we have the following rate expression: 


R* = / (W 2 ; Z) = I (X; Z) — I (X; Z|W 2 ) 
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[MMSE(X; 7 ) - MMSE(X; 7 |W 2 )]d 7 

[MMSE(X; 7 |EE y ) - MMSE(X; j\W z )] d 7 . (125) 

In order to maximize both rates, we can first take MMSE(X; 7 |W J/ ) = 0 for all 7 > snr 2 which is 
exactly the case of optimal security (since we assume reliable decoding of W y at snr ;l/ , Theorem 2). We 
thus need to maximize the following pair: 

1 r sm y 1 rsnr v 

R y = ~ MMSE(X; 7 )d 7 = - / MMSE(X; 7 |IC 2 )d 7 

^ J snr 2 ^ Jsm z 

1 f snrz 

R z = - [MMSE(X; 7 |W y )-MMSE(X; 7 |W*)]d 7 . (126) 

1 Jo 

Now, given that R y = log 1 j * C)I some & £ [0,1], we can conclude from the “single crossing 
point” property, given in Theorem 1, that 

MMSE(X; 7 |W 2 )| > -^- 

v i/i ^| 7=sni - z - 1 + /3 snrz 

and thus, 

Rz < ^log(l + snr 2 ) - 7 ,log(l + /?snr 2 ). 

This is a converse proof. We first claim that this is indeed an achievable pair by using supeiposition. 
Consider codewords X = V + U where V has a power constraint of 1 — /3 and U, independent of 
V , has a power constraint of B. V is an optimal Gaussian code sequence to Z, and thus attains R 2 by 
considering U as additional Gaussian noise. U is a completely secure Gaussian codebook to Y with 
eavesdropper Z, and thus attain R :y . Since Y can first reliably decode V this concludes the achivability 
proof. 

The above converse provides us with a full depiction of the MMSE quantities, since in order to attain 
the above rates with equality we need that 

MMSE(X; 7 ) = —, 7 € [0, snr 2 ), (127) 

1 + 7 



MMSE(X; 7 ) = MMSE(X; 7 |W', / ), 7 G [0,snr 2 ) 


( 128 ) 



and 
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MMSE(X; 7 | W z ) = -^—, 7e[0,snr,) (129) 

1 + PI 

and, finally, that the code sequence is both completely secure and optimally secure; thus we have the set 
of relations given in Corollary 3 and Theorem 2. This concludes the proof. ■ 

Proof of Theorem 9: Observe that the rate pair given in the theorem is a capacity rate pair for 
the Gaussian BC, and thus according to Theorem 6 we have the exact behavior of MMSEfX: 7) for 
all 7 > 0. This proves (121). Due to reliable decoding of W z we have, according to Corollary 3, that 
MMSE(X; 7 ) = MMSE(X; 7 | W 2 ) for all 7 > snr 2 . It remains to show that this is achievable by an 
optimally secure code sequence for the Gaussian BCC. Consider a Gaussian codebook of power 1 — f3 
to be reliably decoded at Z (of rate |l°g (T~Tsn^)) an d superimpose on it, with the remaining power 
of (3, an independent Gaussian optimally secure (maximum level of equivocation) code sequence for the 
wiretap channel to Y with eavesdropper Z, meaning an 

( R „d) = (t|°g(l + /?snr,,),l|og + (BO) 

rate-equivocation pair. Thus, the rate-pair is achievable and is optimally secure: 

d = d opt = I(X-Y)-I(X-Z) 

= I(X-Y\W z )-I(X;Z\W z ) 

= ~log(l + /3sr\r y ) - ^log(l + (3sr\r z ). (131) 

The only MMSE quantity that is not fully defined is MMSEfX: 71W'y) in the region of [0,snrz). For 
this we can use R (/ and claim the following: 
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VI. Gaussian BC and BCC with MMSE Constraints 

In the previous sections we examined the effect of the transmission on unintended receivers at other 
SNRs. We have done so by fully depicting the behavior of the input-output mutual information and 
MMSE function for all SNRs. In this section we examine the trade-off between achievable rates and 
MMSE constraints. These constraints come to limit the amount of disturbance the transmission has 
on unintended receivers at other SNRs. These results are a direct extension of [5] where the trade-off 
between the point-to-point rate and MMSE has been investigated. It was shown that the optimal scheme 
attaining both the maximal possible point-to-point rate and complying with the MMSE constraint is 
supeiposition, supporting the good performance of the Han and Kobayashi scheme [6] in the two-user 
Gaussian interference channel. 

We first consider an MMSE constraint limiting the capacity region of the scalar Gaussian BC, depicted 
in (65). We then consider the effect of this constraint on the Gaussian BCC. More specifically, we consider 
a code sequence transmitting the message pair ( W y . W z ) such that W y must be reliably decoded by Y 
and completely secure from Z , and W z must be reliably decoded by Z. 

On top of these requirements we now consider an additional constraint: 

cx 

MMSE(X;snrJ < - (134) 

1 + asnr u 

for some a £ [0,1]. Of course, the effect of this constraints varies whether snr„ < snr 2 or snr„ £ 
(snr 2 , snr-y). 
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for (3 £ (a, 1]. The second region is achievable using a 3-layer superposition Gaussian code sequence. 
The message to be decoded from Z has 1 — (3 of the power. The second layer can already be reliably 
decoded at snr. u and has power (3 — a, and the final layer to be decoded from Y has power a. 
Assuming that snr M £ [0, snr 2 ) the capacity is again a union of two regions. The first is 


(Ry, R Z ) 



log(l + asnr.y) + A-log 


/ 1 + snr, u 
\1 + asnr u 


(l-A)^log 


/ 1 + snr M A A 
\1 + otsnr u ) ) 


(137) 


where A £ [0,1]. This region is obtained by time sharing. In both schemes we use a two-layer Gaussian 
superposition code sequence where the first layer, of power 1 — a, can be reliably decoded at snr u . 

The second region is 


(Ry, R z ) 


Ql°g(l + /3snry),^log 


/ 1 + snr u 1 + crsnr 2 \ \ 
V1 + asnr u 1 + j3sm z ) ) 


(138) 


where f3 £ [0, a). This region can be obtained by a 3-layer Gaussian superposition code sequence. The 
first layer, of power 1 — a, can be reliably decoded at snr u . The second layer, of power a — (3, can be 
reliably decoded by Z. The third layer, of power (3, is decoded from Y. 


An example for the above result is depicted in Figures 4 and 5, where we show the reduction of the 
capacity region as compared to the case with no MMSE constraint. 

Remark 1. The above regions of rate pairs of reliable communication over the Gaussian BC given an 
MMSE constraint at some lower SNR reduce to the result of [5, Theorem 3] where given such a constraint 
the goal was to maximize the point-to-point reliable rate. When snr u £ (snr 2 ,snry) this can be seen by 
enforcing R 2 = 0 in which case we obtain a two-layer superposition code to Y with a rate-splitting 
coefficient of a. When snr n £ [0, snr z ) we can either take A = 1 in the time sharing argument, or 
alternatively take f3 = a in (138) in which case we maximize the sum-rate, Ry + R .. In both cases we 
again obtain the same expression which can be obtained by a two-layer Gaussian superposition code 
with rate splitting coefficient of a. 


We now provide our main results on the BCC with an MMSE disturbance constraints. We consider 
a code pair sequence transmitting the message pair {W y ,W z ) such that W y must be reliably decoded 
by Y and completely secure from Z, and W z must be reliably decoded by Z. For these requirements 
Corollary 3 defines the properties of the MMSE functions. Moreover, from the proof we see that the 
optimal rate-pair is obtained when we further assume optimal security. The MMSE constraint (134) at 
snr u requires us to distinguish between the two cases of snr u £ [0, snr z ) and snr u £ (snr 2 ,snry) and 
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Fig. 4 . In the above figure we compare the capacity region of the Gaussian BC with no MMSE constraint (in blue) with the 
capacity region given the MMSE constraint (in red). In this example snr„ £ [ 0 ,snr 3 ). 


consider them separately in the next two theorems. 


Theorem 11. Consider a code pair sequence transmitting over the Gaussian channel the message pair 
( W y , W z ) such that W y is reliably decoded by Y and completely secure from Z, and W z is reliably 
decoded by Z. In addition we have the MMSE constraint (134) at snr„ £ (snr,,snr y ). The maximum 
rate pair is of the following form: 


(Rj/j R.z) 


fl (l + /3snr u \ 


1 , /1 + asnr y 

+ x lo S 7--- 

2 Vl + asnr u 



/ 1 + snr^ \\ 
\ 1 + /3snr 2 J J 


(139) 


for some 13 £ [a., 1]. These rate pairs are obtained by a 3-layer superposition code sequence, where the 
first layer is an optimal Gaussian code sequence to Z, of power 1 — f3, the second is a secrecy capacity 
Gaussian code sequence to the unin tended receiver at snr„ with eavesdropper Z, of power f3 — a, and 
the last layer is a secrecy capacity Gaussian code sequence to Y with eavesdropper Z, of power a. 
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Fig. 5. In the above figure we compare the capacity region of the Gaussian BC with no MMSE constraint (in blue) with the 
capacity region given the MMSE constraint (in red). In this example snr„ £ (snr 3 ,snr y ). 


These rates have the following behavior: 


and 


MMSE(X; 7 ) = 


MMSE(X; 7 |W 3 ) = 


1 

1 + 7 ’ 

7 E [0, sni%) 

8 

1+187 ’ 

7 E [snr 3 ,snr u 

a 

l+cry 5 

7 E [snr u , snr y 

0 , 

7 > snr^ 


T+fbfi 7e[0,snr u ) 

< 7 E [snr n , snr^) 

0, 7 > snr y 


(140) 


(141) 


and MMSE( X; y\Wy) is as in Theorem 8. 

In addition the rate pairs depicted in Theorem 8 for (3 E [0, a) and their MMSE behavior are also 
optimal and trivially comply with the constraint. 
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Theorem 12. Consider a code pair sequence transmitting over the Gaussian channel the message pair 
( W y ,W z ) such that W y is reliably decoded by Y and completely secure from Z, and W z is reliably 
decoded by Z. In addition we have the MMSE constraint (134) at snr u G [0, snr+ The maximum rate 
pair is of the following form: 


' R y, R z) = ( 2 lo ® 


1 /l + /3snr<A 1 


1 + /3snr, 


> o lo S 1 + 


(l-a)snr M \ 1 


+ 7 lo S 1 + 


(a ~ /3)snr 2 
1 + (3snr z 


(142) 


1 + Q:snr u 

for some 3 G [0, a]. These rate pairs are obtained by a 3-layer superposition code sequence. The first 
layer, of power 1 — a, is an optimal Gaussian code sequence for the unintended receiver at snr u , the 
second layer is of power a — /3 to Z, and the last layer is a secrecy capacity achieving code sequence 
of power (3 designed for receiver Y with eavesdropper Z. These rate pairs have the following MMSE 
behavior: 


MMSE(X; 7 ) = l 


i 

1 + 7 ’ 


_ J 1 + 0:7 5 


P 

1+/37’ 

0 , 


7 G [ 0 , snr u ) 

7 G [snr u , snr 2 ) 
7 G [snr 2 , srir^) 
7 > snr y 


and 


1 

1+7 


MMSE(X; 7 |VK y ) = ^ 


7 G [0, snr u ) 

7 G [snr u , snr 2 ) 
0 , 7 > snr z 


(143) 


(144) 


and MMSE(X; 7 |W Z ) is as depicted in Theorem 8. 


From the above two theorems we can clearly see the different effects the constraint can have. In 
Theorem 11 the constraint has no effect on the rate to the weaker receiver and only the wiretap code 
sequence has to be altered in order to comply. In Theorem 12 the situation is reversed. In both cases a 
simple supeiposition scheme maximized the rate under the constraint. 

An example for the above results is depicted in Figures 6 and 7, where we show the reduction of the 
capacity region as compared to the case with no MMSE constraint. 
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Fig. 6. In the above figure we compare the capacity region of the Gaussian BCC with no MMSE constraint (in blue) with the 
capacity region given the MMSE constraint (in red). In this example snr„ £ [0,snr,). 


B. Proofs for the Gaussian BC and BCC with MMSE Constraints 

Proof of Theorem 10: We assume snr u £ (snr 2 , snr y ). We arbitrarily set R, as follows: 

R, = iiog( 1+ ; nr O 

2 & \ 1 + fism z / 

for some j3 £ [0,1]. In the proof of Theorem 6 we have shown that 


(145) 


I(X;Z\W z )<I(X;Z)-R z . 


(146) 


From which we concluded (using Theorem 1) that 

MMSE(X; sm z \W z ) < - 

1 + psnr 2 

Due to Theorem 5 we can also conclude that 


MMSE(X;snr 2 ) < 


P 

1 + /3snr 2 ' 


(147) 


(148) 
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Fig. 7. In the above figure we compare the capacity region of the Gaussian BCC with no MMSE constraint (in blue) with the 
capacity region given the MMSE constraint (in red). In this example snr u £ (snr 3 ,snr y ). 


Note that if /? < a we comply with the additional MMSE constraint according to Theorem 1 and thus 
the region is as in (70). The interesting question is what happens when /3 G (a, 1], In this case we have 
two MMSE constraints at snr u and snr z and thus can use [5, Theorem 5] and conclude that 


I{X-Y)< 1 - log 


/ 1 + snr z 1 + (3snr u \ 
\1 + (3 snr 2 1 + asm U J 


+ -log(l + asnr y ) 


Due to the data-processing inequality 


(149) 


R z + R y <I(X;Y) 


(150) 


and thus, 


R„ < -log 


1 /l + ^snr t 


1 + asnr. 


+ -log(l + asnr y ) 


1, /, . 1, A (B — a) snr t 

= 2 + asm v) + 2 log ( 1 + -- 


1 + asnr u 


( 151 ) 
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This rate pair is attainable using a 3-layer Gaussian supeiposition code sequence. The message to be 
decoded from Z has 1 — /? of the power, the second layer can already be reliably decoded at snr. a and 
has power of (5 — a, and the final layer to be decoded from Y has power of a. Each layer can be 
reliably decoded by fully decoding and removing previous layers and considering all subsequent layers 
as additive Gaussian noise. 

The case of snr„ < snr ; , is a bit more complex. Consider first the problem of maximizing I (X: Z) 
given the MMSE constraint at snr u . This follows [5, Theorem 3]: 


I(X;Z)< 1 - logf 1 + SPr - 
V ’ J ~ 2 & V l + asnr„ 


+ -log(l + asnr 2 ). 


Thus, we consider the following arbitrary rate of transmission of W z : 

R* = ^log ( 1 + snr ' u \ ~log(l + asnr,) - ^log(l + f3snr z ). (153) 

2 \1 + asnr, u y 2 2 

When /3 = 0 we obtain the maximum possible rate of W z . As for the upper bound on (3, we note that 
the rate must be non-negative, thus 

a ^ (1 — a)snr u + asnr 2 (l + snr u ) _ a /1</n 

P ^ — Pmax- 

snr 2 (l + asnr u ) 

Following the approach in the proof of Theorem 6 we can conclude from the above setting that 


MMSE(X; sm z \W z ) < 


1 + f3sm z 


Due to Theorem 5 we can also conclude that 


MMSE(X;snr 2 ) < 


1 + snr z f3' 


This constraint is valid only when /3 < a, otherwise it is redundant (as the constraint at snr„ already sets 
a stronger upper bound at snr 2 ). Thus, we split the analysis to two cases. First 3 G [a, /3 m ax]- In this case 
we use [5, Theorem 3], meaning maximizing the mutual information given a single MMSE constraint at 


R y + R z <I(X-,Y) 


< hoe f 1 + snr “ 

2 \l + asnr t 


+ -log(l -j-asnr^) 



and thus 
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R «-5 log (TT5S) + 5 log<1 + /Js " ^ ' , ' (158) 

When 3 = /3 max we have the following rate pair: 

(R„, R,) = (|log o) (159) 

and is clearly achievable using the Gaussian superposition codebook as shown in [5, Theorem 3] (R z = 0). 
If /3 = q we have 

(R »’ Ri) = G' og(1+ “ snr >' ) - l'° s (itS;)) ■ (160) 


This is again achievable using a Gaussian supeiposition codebook sequence, where the first layer is of 
power 1 — a and can be reliably decoded by a user at SNR of snr u (and thus also by Z), while considering 
the second layer as additive Gaussian noise. The second layer is of power a, and can be reliably decoded 
from Y, once the first layer is reliably decoded and removed. 

Now, the rate pair we obtained: 


(Ry, Rj.) 


^log(l + asnrj / ) + ^log 


/1 + /jsnr 2 \ 
\l + asnr z / 


h„ g ( 1 + SI M 

2 \ 1 + asnr n / 


h°g( 1+gsnr ' N )) 

2 6 \ 1 + crsnr z / / 

(161) 


which given that 3 £ [a, /3 max ] is equivalent to 


(Ry, R z ) 


^log(l + asnr. y ) + A^log 


/ 1 + snr u 
\1 + asnr u 


(l-A)^log 


/ 1 + snr„ A A 
VI + crsnr u J ) 


(162) 


meaning that every point in this region can be obtained by time-sharing between the above two extreme 
cases of 3 = 3mnx and 3 = a. 

Second, consider the case of 3 £ [0, a). In this case we use [5, Theorem 5], meaning maximizing the 
mutual information given two MMSE constraint at snr„ and snr 2 : 


Ry + R z < I (X; Y) 


< -log 
- 2 6 


1 + snr u 1 + asnr, 
1 + asnr u 1 + /)snr 2 


+ -M 1 H- /3snr y ) 


and thus 


(163) 


Ry < -log(l + 3 sm y ). 


( 164 ) 
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The rate pair is then given by 


(Rj/ ■> Rz) — 


^log(l + /3snr y ),^log 


/ 1 + snr u 1 + asnr 2 \ \ 
V1 + «snr. a 1 + /?snr 2 / / 


(165) 


which can be obtained by a 3-layer Gaussian supeiposition code sequence: the first layer of power 1 — a, 
can be reliably decoded at snr„ while considering all subsequence layers as additive Gaussian noise. 
The second layer of power a — (3 can be reliably decoded by Z after reliably decoding the first layer 
and considering the third layer as additive Gaussian noise. The third layer of power f3 is decoded after 
reliably decoding and removing all previous layers. ■ 

Proof of Theorem 11: First, for (3 £ [0, a), from Theorem 8 we can see that an optimal rate pair 
complies with the constraint. Thus, we can focus only on (3 £ [a, 1]. As we consider complete secrecy, 
given the MMSE properties of Corollary 3 we have the following expression for the rate pair: 


R 


R 




[MMSE(X; 7 ) - MMSE(X;7|FF y )] d 7 
[MMSE(X; 7 |VF 2 ) - MMSE(X; 7 |Wy)]d 7 


MMSE(X; 7 |VF 2 )d 7 


[MMSE(X; 7 ) - MMSE(X; 7 |FF 2 )]d 7 


(166) 


where the inequality is obtained when MMSE(X; 7 | Wy) = 0 for all 7 > snr 2 , that is, an optimally 
secure code sequence. Recall that (Corollary 3) 


MMSE(X; 7 ) = MMSE(X; 7 |FF 2 ), V 7 > snr 2 , 


(167) 


and thus the constraint at snr u £ (snr 2 , snr.y) is also a constraint on MMSE(X; 7 | W z ). Consider a function 
MMSE(X; 7 |FF 2 ) that satisfies the constraint and define f3 such that 

MMSE(X; 7 |FF 2 )d 7 = ^log(l + /3snr 2 ) (168) 

where (3 £ [a, 1]. By applying the “single crossing property”, given in Theorem 1, twice, first on (168) 
and second on the constraint MMSE(X; sm u \W z ) < 1+ ^ t snr we obtain the following upper bounds on 
the rates: 

1 r sm v 

R y <- MMSE(X; 7 |FF 2 )d 7 

^ J snr r 
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1 

< 

“ 2 


1 

2 _ 

snr 


i /'snry 

I MMSE(X; 7 |W 2 )d7+- / MMSE(X; 7 |FF 2 )d 7 

snr z ^ J snr u 


< - 
” 2 


snr 2 

]_ /*snr z 


^d 7+ x r 

l + /3 7 r 2 


MMSE(X; 7 |M4)d 7 


/? 


snr 2 


1 + /3 7 d7 + 2 


snr„ 

1 r sm y 


a 


' snr„ 


1 + cry 


d 7 


= ~log(l + /3snr. u ) - ^log(l + /3snr 2 ) + ^log(l + asnr^) - ^log(l + asnr„ 


(169) 


We also have 


r- _ i r 

2 7o 


[MMSE(X; 7 ) - MMSE(X; 7 |W 2 )]d 7 


< ~log(l + snr 2 ) - ~log(l + /3snr 2 ). 


(170) 


The achievability of the above rate pair follows simply by using a supeiposition code sequence. Consider 
X = V + U 1 + U 2 , where V, U\ and U\ are three independent Gaussian codebook sequences. V is an 
optimal Gaussian codebook sequence to Z of power I — 3. U \ is a secrecy capacity achieving Gaussian 
code sequence designed for reliable decoding at snr l( with the eavesdropper Z and power f3 — a. Finally, 
U-) is a secrecy capacity achieving code sequence for Y and eavesdropper Z and power a. The decoding 
of each layer is possible by fully decoding the previous layers and considering any subsequent layers as 
additive Gaussian noise. ■ 

Proof of Theorem 12: First note that by complying with the constraint we have the following upper 
bound: 


MMSE(X; 7 ) < 


a 


V 7 > snr. u . 


1 + a 7 

Due to the decoding requirement at Z this translates also to an upper bound on MMSEfW; 7 1ITT 


(171) 


MMSE(X; 7 ) = MMSE(X; 7 |FF 2 ) < 


a 


1 + cry 


, V 7 > snr 2 . 


(172) 


Denote 


MMSE(X;snr 2 ) = MMSE(X; snr 2 |W 2 ) = 


P 


1 + /?snr 2 


(173) 


If f3 > a we cannot comply with the constraint at snr u , and so we consider /3 G [0, a}. Due to the “single 
crossing point” property this choice will also comply with the upper bound for all 7 > snr 2 . Moreover, 
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this provides a lower bound on the following integral: 


1 

2 



MMSE(X; 7|W4)d7 > 


P 


'o 1 + Pi 


d7 


-log(l + /?snr 2 ). 


(174) 


Moreover, we have a direct upper bound on R (/ , namely 


1 r sm y \ 

R;y = 2 / MMSE(X; 7 )d 7 =- 


MMSE(X; 7 |^)d7 


2 V1 + /3snr 2 


(175) 


It remains to upper bound R, under the constraint 


R, = \ I™' [MMSE(X; 7 ) - MMSE(X; 7 |W*)]d 7 
1 Jo 

1 f srvr ~~ 1 

<-/ MMSE(X; 7 )d 7 --log(l+ /3snr.) (176) 

z Jo 1 

which is equivalent to maximizing the mutual information I (X;Z) subject to an MMSE constraint at 
some lower SNR. Note that we also need to consider the additional constraint we set at (173); however 
we will disregard this constraint and see that due to the choice of ft G [0, a] we get this constraint “for 
free”. Thus, the remaining problem is exactly the problem investigated in [5] and the solution is 


R z < ^log(l + snr. u ) + ^log 

= hog (i + (1 ~ a)snr " 
2 \ l + asnr u 


1 + asnr~ 
1 + asnr„ 


1 

2 


log(l + /?snr z ) 





(a - /3)snr z \ 

1 + /3snr 2 ) ' 


(177) 


The above rate pair can be obtained by a 3-layer supeiposition code sequence. The first layer, of power 
1 — a, is an optimal Gaussian code sequence that can be reliably decoded at snr u while considering all 
subsequent layers as additive Gaussian noise. The second layer is of power a — /? and can be reliably 
decoded by Z after reliably decoding the first layer and considering the subsequent layer as additive 
Gaussian noise. The last layer is a secrecy capacity achieving code sequence of power /? designed for 
receiver Y with eavesdropper Z. ■ 


VII. Summary and Conclusions 

In this work we have considered several mutli-user scalar Gaussian settings in the unified framework 
of the scalar Gaussian channel. Under this framework the Gaussian BC, the Gaussian wiretap channel, 
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the Gaussian BCC and the Gaussian BC and Gaussian BCC with MMSE disturbance constraints have 
been defined using the requirements at different points on the SNR axis. Using the I-MMSE relationship 
we have proved MMSE properties required from any code sequence complying with these requirements. 
In some cases these properties completely define the specific family of code sequences. 

More specifically, for the Gaussian wiretap channel, where apart from a reliable decoding requirement 
we also have a confidentiality requirement, we have shown that some well known “rules of thumb” 
used in achievability schemes are both necessary and sufficient for any “good” code sequence. The first 
being that, in any d op t code sequence, given the secret message the eavesdropper can fully decode the 
transmitted codeword. The second is that any e/ max code sequence is built on a point-to-point achieving 
codebook sequence and that any partitioning of this codebook sequence to bins defines a completely 
secure message if and only if the bins themselves arc codebook sequence that achieve capacity to the 
eavesdropper, that is, saturating the eavesdropper. It is also worth emphasizing that the rate-equivocation 
pair (C, d max ) is an achievable pair for the Gaussian wiretap channel [7], [8], meaning that point-to-point 
capacity does not need to be compromised for the sake of maximum confidentiality, as shown in [9, Ex. 
c, pp. 413] where the message is split to a completely secure message and an additional private message 
to obtain the entire rate-equivocation region. 

The Gaussian BC has a unique and well-defined MMSE behavior for any “good” code sequence. This 
behavior holds regardless of the implementation, meaning the same behavior for both a supeiposition 
code sequence and a “dirty paper” code sequence. Not surprisingly, “bad” code sequences, that do not 
achieve capacity, are less defined, but a necessary and sufficient condition for the reliable decoding of the 
message to the weaker receiver requires that the conditioned MMSE (conditioned on that message) equals 
the MMSE of the codeword from the SNR of reliable decoding and onwards. This is not a surprising 
result but is worth mentioning due to the fact that strict concavity does not hold here, as opposed to 
any finite n [34, Theorem 2], Finally, we derive necessary and sufficient conditions for a “good” code 
sequence, in terms of the MMSE and conditional MMSE behavior at all SNRs. 

The Gaussian wiretap channel and the Gaussian BC come together to define the Gaussian BCC. In this 
setting we have examined two cases: complete secrecy and optimally secure code sequences. We first 
observed that “good” code sequences under both requirements have the same codeword MMSE as the 
Gaussian BC. The difference between the cases and compared with “good” sequences for the Gaussian 
BC is the behavior of the conditional MMSE quantities. For both problems we have provided an I-MMSE 
based converse proof to the capacity region. 

Finally, we have considered also the effect of additional MMSE constraints at some other SNR. This 
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continues the work in [5] where such constraints were originally examined, and provided interesting 
insights to the two-user Gaussian interference channel problem. Given the MMSE constraint we have 
examined both the Gaussian BC and the Gaussian BCC. In both cases it was necessary to distinguish 
between two possible regions for the SNR of the MMSE constraint. We have provided a capacity region 
proof for both problems and in the BCC case also a full depiction of the behavior of the different MMSE 
quantities. We have observed that as in [5] the capacity achieving approach is supeiposition, however 
here we sometimes require a 3-layer supeiposition code sequence. Moreover, in the Gaussian BCC, 
depending on the SNR of the constraint, the second layer is either an optimal Gaussian code sequence 
to the eavesdropper (when the SNR of the constraint is lower than the SNR of the eavesdropper) or a 
Gaussian secrecy capacity achieving code sequence designed for a legitimate receiver at the constraint 
SNR and the eavesdropper (when the SNR is between the SNR of the eavesdropper and the legitimate 
receiver). 

Appendix 

A. Applying Fatou ’s Lemma 

In this section we formally prove (12). Examine the following sequence: 

iMMSE^-^p) - MMSE (C ’ /) ( 7 ) s . up |. (178) 

Note that by definition of the limsup (equivalent to (13)) we have that 

limsup|MMSE( C " / »)( 7 ) - MMSE (c ^( 7 ) snp | = 0. (179) 

n—>oc 

Using the reverse Fatou’s Lemma [35] (the MMSE function is bounded for all n ) we have that 


limsup / MMSE (C "^")( 7 ) — MMSE^ c ’'^( 7 ) sup |d 7 < 


/ limsup |MMSE^)( 7 ) - MMSE( c ^( 7 )^ p | = 0. (180) 

J n—>oo 

Thus, due to the non-negativity of the integrand we can conclude that 

limsup /'MMSE (a * / " ) ( 7 )d 7 = [ MMSE (c ^( 7 ) sup d 7 . (181) 

n—»oo J J 

On the other hand we assume that all normalized information quantities are stable (i.e., converge) meaning 


lim -I (X n ; \/smX n + N n ) = limsup -I (X n -,y/smX n + N n ) . 

n—»oo n n - s.oo n 


( 182 ) 
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Due to the I-MMSE relationship [29] we have that 

psnr psr\r 

lim / MMSE^ c '" , ^^( 7 )d 7 = limsup / MMSE (C,,1, ^( 7 )d 7 . 

n—>°o J Q n—Too Jo 

Putting (181) and (183) together we have that 

p snr psm 

lim / MMSE (C " / » ) (7)d7 = / MMSE^fr)*^. 

n ^°° Jo Jo 

B. Proof of Theorem 1 
Proof: We define 

^x„ iU ,7 {iJn) = (X n , u — E { X n ^ u | y n }) [X n ^ u — E {X n , u \ y n }) \y n | 

Note that 


(183) 


(184) 


(185) 


Ex„( 7 ,«) = E{*x„, U) 7 (y„)} (186) 

where the expectation is over Y n . The following proof is similar in nature to the extension to the 
conditioned case discussed in [30]. It follows the proof of [30, Theorem 1], We only emphasize the 
extension required for the conditioned case. In [30, Lemma 3] it was shown that 

D^q(X n \U = u, a 2 , 7 ) = -Tr (E {* x „ u , 7 (r„) 2 }) - 9 , 2 - (187) 

n (l + 7cr-) z 

Taking the expectation over U and using the lineality of the expectation and the trace function we have 
that 

D,q{X n \U,o 2 ^) = -Tr(E{$ x „ |C/ , 7 (r n ) 2 }) - / (188) 

n (1 + r yo z ) z 

We can now follow the derivation done in [30, equations (20)-(24)] and conclude that for all q(X„\U, o 2 , 7 ) < 
0 the derivative with respect to 7 is positive. Following the proof of [30, Theorem 1] we conclude the 
proof. ■ 


C. Theorem 1 as n —> 00 

In this section we show that the “single crossing point” property given in Theorem 1 holds also for 
lirninf q(X n \U, a 2 , 7 ) = lim ( inf q(X m \U, a 2 , 7 )] . (189) 

n—>00 n—>00 \ m>n J 
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As such we can conclude a “single crossing point” property between the i.i.d. Gaussian input and the 
lim sup M MSE function, whenever the limit of the MMSE function does not exist (and with the lim M MSE 
when it does exist). For this puipose we show that, for any natural n, the function 

inf q(X m \U,a 2 ^) (190) 

m>n 

has at most a single crossing point. Since this function is monotonically increasing in n (and bounded, 
so that it has a limit), the fact that it has at most a single crossing point suffices to conclude at most a 
single crossing point for the limit. Recall that for every natural n q(X n \U, a 2 . 7 ) has at most a single 
crossing point, meaning that if for some 7 

inf q(X n \U,a 2 ,'y) > 0 (191) 

m>n 

the crossing point has occurred for all q(X m \U, a 2 , 7 ), m>n , and we can conclude that 

inf q(X n \U, a 2 , 7 ') > 0, V 7 ' > 7 , (192) 

m>n 

meaning that no transition from a non-negative value to a negative value can occur; hence there is at 
most a single crossing point. 
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